Goto

Collaborating Authors

 great time


VIST-GPT: Ushering in the Era of Visual Storytelling with LLMs?

Gado, Mohamed, Taliee, Towhid, Memon, Muhammad, Ignatov, Dmitry, Timofte, Radu

arXiv.org Artificial Intelligence

Visual storytelling is an interdisciplinary field combining computer vision and natural language processing to generate cohesive narratives from sequences of images. This paper presents a novel approach that leverages recent advancements in multimodal models, specifically adapting transformer-based architectures and large multimodal models, for the visual storytelling task. Leveraging the large-scale Visual Storytelling (VIST) dataset, our VIST-GPT model produces visually grounded, contextually appropriate narratives. W e address the limitations of traditional evaluation metrics, such as BLEU, METEOR, ROUGE, and CIDEr, which are not suitable for this task. Instead, we utilize RoViST and GROOVIST, novel reference-free metrics designed to assess visual storytelling, focus - ing on visual grounding, coherence, and non-redundancy. These metrics provide a more nuanced evaluation of narrative quality, aligning closely with human judgment.


These games were indie smash hits – but what happened next?

The Guardian

It is now more or less impossible to put a precise figure on the number of video games released each year. According to data published by the digital store Steam, almost 19,000 titles were released in 2024 – and that's just on one platform. Hundreds more arrived on consoles and smartphones. In some ways this is the positive sign of a vibrant industry, but how on earth does a new project get noticed? When Triple A titles with multimillion dollar marketing budgets are finding it hard to gain attention (disappointing sales have been reported for Dragon Age: The Veilguard, the Final Fantasy VII remakes and EA Sports FC), what chance is there for a small team to break out?


Prime Day drops Google Nest devices to record-low prices

Engadget

Amazon Prime Day is usually a great time to pick up things for your home, particularly smart home tech. This year, a bunch of Google's Nest devices have been discounted, with many down to record-low prices. These gadgets are best for anyone who already lives within the Google ecosystem, especially those who already rely on the Google Assistant to help them get things done. You'll find a few Nest security cameras on sale for Prime Day, as well as video doorbells and Wi-Fi systems. If you're looking for even more Prime Day deals, check out Engadget's Prime Day hub where you'll find all of the best tech deals you can get for the shopping event this year.


Unveiling the Complexity of High-Dimensional Time Series Forecasting with PCA

#artificialintelligence

Summer is here and with that comes the perfect time to get outside and enjoy the sunshine. Whether you're a fitness enthusiast, an outdoor enthusiast, or just someone who loves to relax, there are plenty of activities to choose from when it comes to summertime fun. For fitness lovers, it's the perfect time to get outside and enjoy the warm weather. Hiking, running, biking, and swimming are all great options for workouts that can improve physical and mental health. For outdoor enthusiasts, summertime can be a great time to explore the great outdoors.


Ordered Attention for Coherent Visual Storytelling

Braude, Tom, Schwartz, Idan, Schwing, Alexander, Shamir, Ariel

arXiv.org Artificial Intelligence

We address the problem of visual storytelling, i.e., generating a story for a given sequence of images. While each sentence of the story should describe a corresponding image, a coherent story also needs to be consistent and relate to both future and past images. To achieve this we develop ordered image attention (OIA). OIA models interactions between the sentence-corresponding image and important regions in other images of the sequence. To highlight the important objects, a message-passing-like algorithm collects representations of those objects in an order-aware manner. To generate the story's sentences, we then highlight important image attention vectors with an Image-Sentence Attention (ISA). Further, to alleviate common linguistic mistakes like repetitiveness, we introduce an adaptive prior. The obtained results improve the METEOR score on the VIST dataset by 1%. In addition, an extensive human study verifies coherency improvements and shows that OIA and ISA generated stories are more focused, shareable, and image-grounded.


16 Great Deals on Our Favorite Video Games and Accessories

WIRED

In many parts of the United States, we're in the thick of summer and it's time to admit: It's not getting any cooler out. It would be a great time to go to the pool or the beach. Alternatively, if you'd rather weather the heat waves indoors with air conditioning, it's a great time to hunker down with some gaming deals. Don't see anything you like here? Don't forget to check out our other buying guides, including our guide to the Best Wireless Gaming Headsets or the Best Soundbars. Special offer for Gear readers: Get a 1-year subscription to WIRED for $5 ($25 off).


Google Engineer On Leave After He Claims AI Program Has Gone Sentient

#artificialintelligence

A Google engineer is speaking out since the company placed him on administrative leave after he told his bosses an artificial intelligence program he was working with is now sentient. Blake Lemoine reached his conclusion after conversing since last fall with LaMDA, Google's artificially intelligent chatbot generator, what he calls part of a "hive mind." He was supposed to test if his conversation partner used discriminatory language or hate speech. As he and LaMDA messaged each other recently about religion, the AI talked about "personhood" and "rights," he told The Washington Post. It was just one of the many startling "talks" Lemoine has had with LaMDA.


3 Top Artificial Intelligence Stocks to Buy in March

#artificialintelligence

Artificial intelligence (AI) is often used as a buzzword when companies are trying to sell their product. They often have some form of AI, but it really isn't as much of a game-changer as it is hyped up to be. However, three businesses with real AI products making a difference in the industry are Nvidia ( NVDA -2.46%), CrowdStrike ( CRWD -0.25%), and C3.ai ( AI -9.82%). This trio of stocks is highly diversified and gives investors three different avenues to approach an investment in AI. Nvidia provides the hardware powering AI technology, CrowdStrike uses AI in cybersecurity, and C3.ai's tools help enterprises predict the future across a massive organization. When deployed correctly, artificial intelligence can make a huge difference in a product, and each of these businesses achieves that.


NAREOR: The Narrative Reordering Problem

Gangal, Varun, Feng, Steven Y., Hovy, Eduard, Mitamura, Teruko

arXiv.org Artificial Intelligence

We propose the task of Narrative Reordering(NAREOR) which involves rewriting a given story in a different narrative order while preserving its plot, semantic, and temporal aspects. We present a dataset, NAREORC, with over 1000 human rewritings of stories within ROCStories in non-linear orders, and conduct a detailed analysis of it. Further, we propose novel initial task-specific training methods and evaluation metrics. We perform experiments on NAREORC using GPT-2 and Transformer models and conduct an extensive human evaluation. We demonstrate that NAREOR is a challenging task with potential for further exploration.


The 5 best Amazon deals you can get this Monday

USATODAY - Tech Top Stories

This Monday, save on pizza cutter wheels, robot vacuums, and more. If you make a purchase by clicking one of our links, we may earn a small share of the revenue. However, our picks and opinions are independent from USA Today's newsroom and any business incentives. Chances are, you share my sentiments exactly, especially if you spent your weekend by the beach (like me) or you stayed indoors where it's nice and cool. The point is, nobody likes Mondays.